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(54) Method and apparatus for adaptively scaling motion vector information 



(57) A method (200) and apparatus (100) for reduc- 
ing memory and memory bandwidth requirements in an 
MPEG-like decoder by compressing (300) image infor- 
mation prior to storage such that a reduced resolution 
image information frame is stored and subsequently uti- 
lised by, e.g., a motion compensation module of the de- 



coder. The invention seeks to responsively processes 
(235; 245; 250) motion vector information in a manner 
consistent with the amount of compression imparted to 
a predicted image information frame, and the type (225) 
of prediction employed in forming the predicted informa- 
tion frame. 



-205 



NEXT PIXEL 
BLOCK 



CM 

< 

N 
O 
CD 

m 
to 



Q. 
HI 



220 



225 




JL 



235 



MWr » (MW+1)(SCALE FACTOR) - 1 
MVHr « (MVH)(SCALE FACTOR) 





NO 


1 


' r 



240 



/mv pointing fromNJ^L 
\bot to top held?/ 



JL 



245 



MWr m (MW-1 )(SCAL£ FACTOR) + 1 
MVHr « MVH * SCALE FACTOR 



NO 



JL 



250 



MWr - (MW)(SCALE FACTOR) 
MVHr = (MVHKSCALE FACTOR) 



_T 255 
J TRUNCATE MWr I 
~ TRUNCATE MVHrj^ 



J 



FIG. 2 



Prhted by Jouve, 75001 RARIS <FH) 



E?O0Q9^SG7 (file ; ! S . *P AT E N T F I LE S\Vi xs Syste ns (1 459 JGL) W [XS . Pa te nts^yiXS 0 1 0C\R e fere nces\EP00095560 7 cpc] Page 2 of 12 



EP 0 955 607 A2 

Description 

[0001] The present invention relates to a method and apparatus for adaptively scaling motion vector information. An 
illustrative embodiment of the invention relates to communications systems generally and, more particularly, the in- 
5 vention relates to a method and apparatus for adaptively scaling motion vector information in an information stream 
decoder, such as an MPEG-like video decoder. 

[0002] In several communications systems the data to be transmitted is compressed so that the available bandwidth 
is used more efficiently. For example, the Moving Pictures Experts Group (MPEG) has promulgated several standards 
relating to digital data delivery systems. The first, known as MPEG-t refers to ISO/IEC standards 11172 and is incor- 
10 porated herein by reference. The second, known as MPEG-2, refers to ISO/IEC standards 1 3818 and is incorporated 
herein by reference. A compressed digital video system is described in the Advanced Television Systems Committee 
(ATSC) digital television standard document A/53, and is incorporated herein by reference. 

[0003] The above-referenced standards describe data processing and manipulation techniques that are well suited 
to the compression and delivery of video, audio and other information using fixed or variable length digital communi- 

is cations systems. In particular, the above-referenced standards, and other "MPEG-like" standards and techniques, 
compress, illustratively, video information using intra-frame coding techniques (such as run-length coding, Huffman 
coding and the like) and inter-frame coding techniques (such as forward and backward predictive coding, motion com- 
pensation and the like). Specifically, in the case of video processing systems, MPEG and MPEG-like video processing 
systems are characterised by prediction-based compression encoding of video frames with or without intra- and/or 

20 inter-frame motion compensation encoding. 

[0004] In a typical MPEG decoder, predictive coded pixel blocks (i.e., blocks that comprise one or more motion 
vectors and a residual error component) are decoded with respect to a reference frame (i.e., an anchor frame). The 
anchor frame is stored in an anchor frame memory within the decoder, typically a dual frame memory. As each block 
of an anchor frame is decoded, the decoded block is coupled to a first portion of the dual frame memory. When an 

25 entire anchor frame has been decoded, the decoded blocks stored in the first portion of the dual frame memory are 
coupled to a second portion of the dual frame memory. Thus, the second portion of the dual frame memory is used to 
store the most recent full anchor frame, which is in turn used by a motion compensation portion of the decoder as the 
reference frame for decoding predictive coded blocks. 

[0005] To reduce the amount of memory required to implement the above anchor frame memory, it is known to 
30 compress (i.e. , resize) anchor frame image information prior to storage in the anchor frame memory. To ensure accurate 
prediction using such resized reference image information, it is necessary to correspondingly resize the prediction 
motion vectors that will utilise the resized reference image information. Present arrangements providing such resizing 
of images and related motion vector information do not produce satisfactory results under ail conditions. Specifically, 
present arrangements do not function properly in the presence of field prediction encoded macroblocks including inter- 
35 field motion vectors. 

[0006] Therefore, it is seen to be desirable to provide a method and apparatus that significantly reduces the memory 
and memory bandwidth required to decode a video image while substantially retaining the quality of a resulting full- 
resolution or downsized video image. Specifically, it is seen to be desirable to provide such a reduction in memory and 
memory bandwidth even in the presence of field-predictive encoded macroblocks. 

40 [0007] An embodiment of the present invention seeks to provide a method and apparatus for reducing memory and 
memory bandwidth requirements in an MPEG-like decoder. Memory and memory bandwidth requirements are reduced 
by compressing image information prior to storage such that a reduced resolution image information frame is stored 
and subsequently utilised by, e.g., a motion compensation module of the decoder. An embodiment of the present 
invention processes motion vector information in a manner consistent with the amount of compression imparted to a 

4S predicted image information frame, and the type of prediction employed in forming the predicted information frame. 
[0008] One aspect of the present invention provides in a block-based system for decoding a compressed information 
stream including predicted pixel blocks having associated motion vector information, a method for adapting said motion 
vector information to a scaling factor associated with scaled pixel block reference information, comprising the steps 
of: identifying an encoding mode of a predicted pixel block; scaling, using said scaling factor, a horizontal displacement 

50 parameter of each motion vector associated with said predicted pixel block; and in the case of a field prediction encoding 
mode including an inter-field motion vector prediction: imparting, to a vertical displacement parameter of said motion 
vector associated with said predicted pixel block, a first offset; scaling, using said scaling factor, said offset vertical 
displacement parameter; and imparting, to said scaled offset vertical displacement parameter, a second offset. 
[0009] For a better understanding of the present invention, reference will now be made by way of example to the 

55 accompanying drawings in which: 

Figure 1 depicts an embodiment of an MPEG-like decoder according to an embodiment of the invention; 
Figure 2 depicts a flow digram of a motion vector scaling routine according to a further embodiment of the invention 
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and suitable for use in the MPEG-like decoder of Figure 1 ; 

Figure 3A and Figure 38 are flow diagrams of image compression routines suitable for use in the MPEG-like 
decoder of Figure 1; 

Figure 4A is graphical depiction of an 8x8 non-interlaced pixel block having an associated frame -prediction mode 
5 motion vector; 

Figure 4B is a graphical description of a scaled version of the 8x8 non-interlaced pixel block and associated motion 
vector of Figure 4A; 

Figure 5A is graphical depiction of an 8x8 interlaced pixel block having an associated field-prediction mode motion 
vector; and 

10 Figure 5B is a graphical description of a scaled version of the 8x8 interlaced pixel block and associated motion 

vector of Figure 5A. 

[0010] To facilitate understanding, identical reference numerals have been used, where possible, to designate iden- 
tical elements that are common to the figures. 
'5 [0011] Embodiments of the present invention will be described within the context of a video decoder, illustratively an 
MPEG-2 video decoder. However, it will be apparent to those skilled in the art that embodiments of the present invention 
are applicable to any video processing system, including those systems adapted to DVB, MPEG-1 , MPEG-2 and other 
information streams. Embodiments of the present invention are particularly well suited to systems utilising inter-field 
motion vector prediction. 

20 [001 2] Specifically, embodiments of the present invention will be primarily described within the context of an MPEG- 
like decoding system that receives and decodes a compressed video information stream IN to produce a video output 
stream OUT. Embodiments of the present invention operate to reduce memory and memory bandwidth requirements 
in the MPEG-like decoder by compressing image information prior to storage such that a reduced resolution image 
information frame is stored and subsequently utilised by, e.g., a motion compensation module of the decoder. Embod- 

25 iments of the present invention process motion vector information in a manner consistent with the amount of compres- 
sion imparted to a predicted image information frame, and the type of prediction employed in forming the predicted 
information frame. 

[0013] Figure 4A is graphical depiction of an 8x8 non-interlaced pixel block having an associated frame-prediction 
mode motion vector. FIGURE 4B is a graphical description of a scaled version (SCALE FACTOR = 2) of the 8x8 non- 
30 interlaced pixel block (i.e., a 4x4 non-interlaced pixel block) and associated motion vector of FIGURE 4A. The motion 
vector associated with the 8x8 pixel block of FIGURE 4A has a horizontal displacement of 3.5 pixels, and a vertical 
displacement of four lines. The corresponding scaled motion vector of FIGURE 4B has, appropriately, a horizontal 
displacement of 1 .75 pixels, and a vertical displacement of two lines. Thus, both pixel and motion vector information 
have scaled appropriately in the representations of FIGURE 4A and FIGURE 4B. 
as [0014] As depicted in above with respect to FIGURE 4A and FIGURE 4B, if the only prediction mode is used is a 
frame prediction mode, than the same scaling factor is used to scale the reference pixel blocks and the motion vectors 
used to form predicted pixel blocks using the scaled reference blocks (e.g., by the motion compensation module 116 
of FIGURE 1). 

[001 5] FIGURE 5A is graph icai depiction of an 8x8 interlaced pixel block having an associated field-prediction mode 
motion vector. FIGURE 5B is a graphical description of a scaled version (SCALE FACTOR = 2) of the 8x8 interlaced 
pixel block (i. e. , a 4x4 interlaced pixel block) and associated motion vector of FIGU RE 5A. The motion vector associated 
with the 8x8 pixel block of FIGURE 5A comprises a (0,0) motion vector. That is, the motion vector points from the first 
line in the first field to the first line in the second field. Furthermore, since the motion vector is coded as a (0,0) motion 
vector, a simple scaling of the motion vector will result in a value of zero. That is, the resulting scaled motion vector 
45 will also be a (0,0) motion vector. 

[0016] When using the scaled (0,0) motion vector to predict the motion of a scaled macroblock, the resulting prediction 
will be incorrect. This is because the scaled motion vector will point from the first line in the first field to the first line in 
the second field. However, since the macroblock has been scaled, it is likely that the motion vector should point to a 
different line. 

[0017] Referring now to FIGURE 5B (a 2:1 scaled version of FIGURE 5A), the pixel domain information has been 
properly scaled, but the (0,0) motion vector value is representative of an incorrect vertical displacement of the motion 
vector. If properly interpreted, the scaled motion vector value would result in a motion vector that pointed to a hart-pel 
above the first line of the second field. However, since a (0,0) motion vector was scaled, resulting in a (0 0) motion 
vector, the scaled motion vector points to the first line in the second field. Thus, in attempting to scale the motion vector 
by a factor of two, the vertical displacement parameter of the motion vector has been effectively doubled. As such the 
scaled motion vector is not appropriate to the scaled pixel information. As such, any predictions using this motion vector 
will result in, e.g., undesirable visual artefacts upon presentation of the decoded images. 

[0018] In view of the foregoing discussion, it can be readily appreciated that, in the case of inter-field motion vector 
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prediction, the 'divide by 2" approach or, more generally, the "scale motion vectors as the pixel Information is scaled" 
approach, results in a vertical displacement shift proportional to the scaling ratio used, and which depends of the parity 
of the source and destination fields. That is, in the case of 2:1 compression, such as depicted in Figures 5A and 5B, 
a one line shift of the 'actual' motion vector occurs. This same shifting by an appropriate constant factor will occur 

s when resizing any motion vector within the context of inter-field motion vector prediction. 

[0019] To compensate for this shifting of motion vectors when using inter-field motion vector prediction, embodiments 
of the present invention utilise a scaling factor representative of the ratio between the two sampling distances. For 
example, in the case of a scaling factor of two (i.e., 2:1 compression), the vertical component of the motion vector is 
resized such that the appropriate scaled vertical displacement of the motion vector is utilised 

io [0020] It is important to note that the vertical displacement shift described above differs for motion vectors pointing 
from top fields to bottom fields and from motion vectors pointing from bottom fields to top fields. That is, in a case of 
a motion vector pointing from a top field to a bottom field, a scaled motion vector will have a positive shift in vertical 
displacement. Therefore, for the case of a motion vector pointing from a top field to a bottom field, in addition to scaling 
the motion vector according to the pixel scaling factor, the positive vertical displacement must be offset. Similarly, in a 

is case of a motion vector pointing from a bottom field to a top field, the scaled motion vector will have a negative vertical 
displacement. Therefore, for the case of a motion vector pointing from a bottom field to a top field, in addition to scaling 
the motion vector according to the pixel scaling factor, the negative vertical displacement must be offset. 
[0021] FIGURE 1 depicts an embodiment of an MPEG-like decoder 100 according to the invention. Specifically, the 
decoder 100 of FIGURE 1 receives and decodes a compressed video information stream IN to produce a video output 

20 stream OUT The video output stream OUT is suitable for coupling to, e.g. , a display driver circuit within a presentation 
device (not shown). 

[0022] The MPEG-like decoder 100 comprises an input buffer memory module 111 , a variable length decoder (VLD) 
module 112, an inverse quantizer (IQ) module 11 3, an inverse discrete cosine transform (IDCT) module 114, a summer 
115, a motion compensation module 116. an output buffer module 118, an anchor frame memory module 117, a pixel 

ss processor 1 20 and a motion vector (MV) processor 1 30. 

[0023] The input buffer memory module 1 1 1 receives the compressed video stream IN, illustratively a variable length 
encoded bitstream representing, e.g., a high definition television signal (HDTV) or standard definition television signal 
(SDTV) output from a transport demultiplexer/decoder circuit (not shown). The input buffer memory module 1 11 is used 
to temporarily store the received compressed video stream IN until the variable length decoder module 112 is ready 

30 to accept the video data for processing. The VLD 112 has an input coupled to a data output of the input buffer memory 
module 111 to retrieve, e.g., the stored variable length encoded video data as data stream S1 . 
[0024] The VLD 112 decodes the retrieved data to produce a constant length bit stream S2 comprising quantized 
prediction error DCT coefficients, a motion vector stream MV and a block information stream DATA. The IQ module 
113 performs an inverse quantization operation upon constant length bit stream S2 to produce a bit stream S3 com- 
as prising quantized prediction error coefficients in standard form. The IDCT module 114 performs an inverse discrete 
cosine transform operation upon bit stream S3 to produce a bitstream S4 comprising pixel-by-pixel prediction errors. 
[0025] The summer 1 1 5 adds the pixel-by-pixel prediction error stream S4 to a motion compensated predicted pixel 
value stream S6 produced by the motion compensation module 116. Thus, the output of summer 1 1 5 is, in the exemplary 
embodiment, a video stream S5 comprising reconstructed pixel values. The video stream S5 produced by summer 

40 11 5 is coupled to the pixel processor 1 20 and the output buffer module 118. 

[0026] The pixel processor 120 compresses the video stream S5 according to a scaling factor SF to produce a 
compressed video stream S5* having a compression ration of 1 :SF. The pixel processor 1 20 operates on a pixel block 
by pixel block basis (e.g., a 4x4, 4x8 or 8x8 pixel block) to compress each pixel block forming an anchor frame such 
that a resulting compressed anchor frame is provided to the anchor frame memory as compressed video stream S5\ 

45 Thus, the memory requirements of anchor frame memory module 117 are reduced by a factor of SF. 

[0027] In one embodiment of the pixel processor 1 20, a pixel block is compressed by subjecting the pixel block to a 
discrete cosine transform (DCT) to produce a DCT coefficient block. A portion (typically high order coefficients) of the 
DCT coefficient block is then truncated. The remaining DCT coefficients are then subjected to an inverse DCT to 
produce a reduced resolution pixel block. The amount of reduction in resolution is determined by the number of DCT 

so coefficients used to reconstruct the truncated pixel block. 

[0028] I n another embodiment of the pixel processor 1 20, an 8x8 pixel block is subjected to a DCT process to produce 
a respective 8x8 DCT coefficient block. If half of the DCT coefficients are truncated, and the remaining DCT coefficients 
are subjected to the IDCT processing, then the resulting pixel block will have approximately half the resolution (i.e., a 
2:1 compression ratio) of the initial pixel block (i.e., a 4x8 or 8x4 pixel block). Similarly, if three fourths of the DCT 

ss coefficients are truncated, and the remaining DCT coefficients are subjected to the I DCT processing, then the resulting 
pixel block will have approximately one fourth the resolution (i.e., a 4:1 compression ratio) of the initial pixel block (i. 
e., a 4x4 pixel block). 

[0029] In another embodiment of the pixel processor 120, a decimation or subsarnpling process is used. That is, a 
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particular compression ratio is achieved by selectively removing pixels from an image represented by pixel information 
within video stream S5. For example, to achieve a 4:1 compression ratio ol an image, every other scan line of an image 
is removed, and every other pixel of the remaining scan lines is removed. In this embodiment, pixel processor 120 
operates to sub-sample, or decimate, the pixel information within video stream S5 to effect a resizing (i.e. , downsizing) 
5 of the video image represented by the pixel data. 

[0030] The anchor frame memory module 1 1 7 receives and stores the compressed video stream S5\ Advantageous- 
ly, the size of the anchor frame memory module 117 may be reduced by an amount consistent with the compression 
ratio utilised by the pixel processor 1 20. 

[0031] The motion vector processor 1 30 receives the motion vector stream MV and block information stream DATA 
io from the VLD 112. The motion vector stream MV comprises motion vector information to be used by the motion com- 
pensation module 116 to predict individual macroblocks based upon image information stored in the anchor frame 
memory module. However, since the image information stored in the anchor frame memory module 117 has been 
scaled by the pixel processing unit 120 as described above, it is also necessary to scale motion vector data used to 
predict macroblocks using the scaled pixel information. The scaled motion vectors MVare coupled to the motion corn- 
's pensation module 116 via path MV. 

[0032] The motion compensation module 116 accesses the compressed (i.e., scaled) image information stored in 
memory module 1 1 7 via signal path S7 1 and the scaled motion vector(s) MV to produce a scaled predicted macroblock. 
That is, the motion compensation module 116 utilises one or more stored anchor frames (e.g., the reduced resolution 
pixel blocks generated with respect to the most recent l-f rame or P -frame of the video signal produced at the output 
20 of the summer 115), and the motion vector(s) MV received from the motion vector processor 130, to calculate the 
values for each of a plurality of scaled predicted macroblocks forming a scaled predicted information stream. 
[0033] Each scaled predicted macroblock is then processed by the motion compensation module 1 16 or by an inverse 
pixel processing module (not shown) following the motion compensation module 11 6 in a manner inverse to the process- 
ing of the pixel processor 120. For example, in the case of the pixel processor 120 performing a down-sampling or 
ss decimation of the video stream S5 produced by summer 115, the motion compensation module 116 performs an up- 
sampling or interpolation of the scaled predicted macroblock to produce a full resolution predicted macroblock. Each 
of the full resolution predicted macroblock are then coupled to an input of adder 115 as motion compensated predicted 
pixel value stream S6. 

[0034] The operation of the motion vector processor 1 30 will now be described in more detail with respect to FIGURE 
30 2. FIGURE 2 depicts a llow diagram of a motion vector scaling routine 200 according to an embodiment of the present 
invention and suitable for use in the MPEG-like decoder 100 of FIGURE 1. Specifically, FIGURE 2 depicts a flow 
diagram of a motion vector scaling routine 20O suitable for use in the motion vector processor 130 of the MPEG-like 
decoder 100 of FIGURE 1. 

[0035] The motion vector scaling routine 200 operates to scale motion vectors associated with a predicted frame (i. 

35 e. , a P-frame or B-f rame) to be processed by the motion compensation module 1 1 6. As previously discussed, to properly 
reconstruct a predicted macroblock using such a reduced resolution anchor frame, it is necessary to appropriately 
scale the motion vectors associated with the predicted macroblock. The motion vector scaling routine 200 adaptively 
scales the motion vectors) in response to the scaling factor used by the pixel processor 120 and the type of motion 
compensation (i.e., frame mode, intra-field mode or inter-field mode) originally used to from the predicted macroblock. 

40 [0036] The motion vector scaling routine 200 is entered at step 205, when, e.g., a predicted macroblock to be decoded 
is received by the variable length decoder 112, which responsively extracts motion vector(s) MV and motion vector 
mode information DATA from the received macroblock. The motion vector(s) MV and motion vector mode information 
DATA is coupled to the motion vector processor 130, as previously described. The routine 200 then proceeds to step 
225. 

45 [0037] At step 225 a query is made as to whether the motion vector(s) MV associated with the received macroblock 
are associated with a field prediction mode. That is, a query is made as to whether motion vector mode information 
DATA identifies the prediction methodology used for the received macroblock as the field prediction mode. For example, 
in the case of an MPEG-2 macroblock, a field-motion-type field within a header portion ol the macroblock may be 
examined. If the query at step 225 is answered negatively, then the routine 200 proceeds to step 250. If the query at 

so step 225 is answered affirmatively, then the routine 200 proceeds to step 230. 

[0038] At step 250 the vertical and horizontal displacement components of the received motion vector(s) are scaled 
per equations 1 and 2 (below), where: 

M W is the vertical displacement component of the received motion vector; 
ss MVH is the horizontal displacement component of the received motion vector; 

MWr is the scaled vertical displacement component of the motion vector; 
MVHr is the scaled horizontal displacement component of the motion vector; and 

SCALE FACTOR is the scaling factor used by, e.g., pixel processor 120 to scale the pixel blocks forming the 
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reference frame. 

[0039] After scaling the vertical and horizontal displacement components of the received motion vector(s) per equa- 
tions 1 and 2, the routine 200 proceeds to step 255. 

MWr = MWxSCALEF ACTOR (equation 1 ) 

MVHr = MVHxSCALEFACTOR (equation 2) 

[0040] At step 230 a query is made as to whether the received motion vector information comprises a motion vector 
pointing from a top field to a bottom field. If the query at step 230 is answered negatively, then the routine 200 proceeds 
to step 240. If the query at step 230 is answered affirmatively, then the routine 200 proceeds to step 235, where the 
vertical and horizontal displacement components of the received motion vector(s) are scaled per equations 3 (below) 
and 2 (above). The routine 200 then proceeds to optional step 255. 

MWr = [(MW+ Ipc(SCALEFACTOR)]- 1 (equation 3) 



[0041] At step 240 a query is made as to whether the received motion vector information comprises a motion vector 
pointing from a bottom field to a top field. If the query at step 240 is answered negatively, then the routine 200 proceeds 
to step 250. If the query at step 240 is answered affirmatively, then the routine 200 proceeds to step 245, where the 
vertical and horizontal displacement components of the received motion vector(s) are scaled per equations 4 (below) 
25 and 2 (above). The routine 200 then proceeds to optional step 255. 

MWr = [(MW- 1)x(SCALEFACTOR)]+ 1 (equation 4) 



30 [0042] At optional step 255 the scaled vertical (MWr) and horizontal (MVHr) displacement components of the re- 
ceived motion vector(s) are truncated to conform to, e.g., the half pel resolution of an MPEG-like decoding system. 
Alternatively, the MPEG-like decoder may keep the increased resolution of the motion vectors by utilising a finer pre- 
diction grid or co-ordinate system. The routine 200 then proceeds to step 220, to await reception of the next predicted 
pixel block by the VLD 112. 

35 [0043] FIGURE 3A depicts a pixel scaling routine 300A suitable for use in the pixel processor 120 of FIGURE 1 The 
pixel scaling routine 300A is entered at step 305, when a pixel block, illustratively an 8x8 pixel block is received by 
pixel processor 1 20 via video stream S5. The routine 300A then proceeds to step 31 0, where a discrete cosine transform 
(DCT) is performed on the received pixel block. For example, in the case of an 8x8 pixel block, a two dimensional DCT 
(or a plurality of one dimensional DCTs) is performed on the received pixel block to produce an 8x8 DCT coefficient 
40 block. The routine 300A then proceeds to step 31 5. 

[0044] At step 315 a plurality of DCT coefficients are truncated per the scaling factor. Thus, in the case of a scaling 
factor of two (i. e. , 2: 1 compression) half of the DCT coefficient (typically the higher order DCT coefficients) are truncated. 
Similarly, in the case of a scaling factor of four (i.e., 4:1 compression) three fourths of the (higherorder) DCT coefficients 
are truncated. The routine 300A then proceeds to step 320. 

[0046] At step 320 an inverse DCT is performed on the remaining DCT coefficients to produce a reconstructed pixel 
block comprising a subset of the pixel information within the received pixel block. For example, in the case of an 8x8 
pixel block undergoing 2:1 compression, the 32 DCT coefficients representing the higher vertical or horizontal spatial 
frequency information of the received pixel block are truncated at step 315. The remaining 32 DCT coefficients are 
subjected to the IDCT processing at step 320 to produce a 32 pixel block (i.e., a 4x8 or 8x4 pixel block). In a case of 
4:1 compression of a received 8x8 pixel block, where all DCT coefficients except the 16 lower frequency DCT coeffi- 
cients truncated, the 16 DCT coefficients representing the lower vertical and horizontal spatial frequency information 
of the received pixel block are subjected to an inverse DCT process to produce a 4x4 pixel block. The routine 300A 
then proceeds to step 325, where it is exited. 

[0046] FIGURE 38 represents an alternate embodiment of the pixel scaling routine 300 of pixel processor 120 
Specifically, the routine 300b of FIG 3B is entered at step 330, when a pixel block is received by pixel processor 120 
The routine 300B proceeds to step 333, where the received pixel block is low pass filtered, and to step 335, where the 
received pixel block is decimated or sub-sampled according to the scale factor to achieve an appropriate compression 
ratio. For example, pixels and/or lines of pixels are deleted from the video information stream S5 to produce a reduced 
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pixel (i.e., compressed) video stream S5\ 

[0047] While embodiments of the present invention have been described primarily in terms of scaling motion vectors 
and pixel domain information by a factor of two, it must be noted thai the embodiments are well suited to other scaling 
factors (integer and non-integer). Moreover, while embodiments of the present invention have been described primarily 

5 in terms of scaling down (i.e., reducing pixel domain information prior to storage), the embodiments are well suited to 
scaling up (i.e., increasing pixel domain information). Such scaling up of pixel domain information and motion vector 
information is especially applicable to applications requiring the presentation of low resolution image information using 
a high resolution display device. For example, the presentation of standard definition television (SDTV) on a high 
definition television (HDTV) display device. One skilled in the art and informed by the teachings of the present invention 

10 will readily devise additional and various modifications to the above-described embodiments of the invention. 

[0048] The present invention can be embodied in the form of computer-implemented processes and apparatuses 
for practicing those processes. The present invention also can be embodied in the form of computer program code 
embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other computer readable storage 
medium, wherein, when the computer program code is loaded into and executed by a computer, the computer becomes 

'5 an apparatus for practicing the invention. The present invention can also be embodied in the form of computer program 
code, for example whether stored in a storage medium, loaded into and/or executed by a computer, or transmitted 
over some transmission medium, such as over electrical wiring or cabling, through fibre optics, or via electromagnetic 
radiation, wherein, when the computer program code is loaded into and executed by a computer, the computer becomes 
an apparatus for practicing the invention. When implemented on a general-purpose microprocessor, the computer 

20 program code segments configure the microprocessor to create specific logic circuits. 

[0049] Although various embodiments which incorporate the teachings of the present invention have been shown 
and described in detail herein, those skilled in the art can readily devise many other varied embodiments that still 
incorporate these teachings. 

25 

Claims 



1. In a block-based system for decoding a compressed information stream including predicted pixel blocks having 
associated motion vector information, a method for adapting said motion vector information to a scaling factor 
associated wfth scaled pixel block reference information, comprising the steps of: 

identifying (225) an encoding mode of a predicted pixel block; 

scaling (235; 245; 250), using said scaling factor, a horizontal displacement parameter of each motion vector 
associated with said predicted pixel block; and 

in the case of a field prediction encoding mode including an inter-field motion vector prediction: 
imparting (235; 245), to a vertical displacement parameter of said motion vector associated with said predicted 
pixel block, a first offset; 

scaling (235; 245), using said scaling factor, said offset vertical displacement parameter; and 
imparting (235; 245), to said scaled offset vertical displacement parameter, a second offset. 



35 



40 



2. The method of claim 1 , further comprising the step of: 

in the case of a field prediction encoding mode not including an inter-field motion vector prediction: 
scaling (250), using said scaling factor, said vertical displacement parameter of said motion vector associated 
46 with said predicted pixel block. 

3. The method of claim 1 , wherein: 

if said inter-field motion vector prediction comprises a top field prediction into a bottom field (230). said first 
60 offset comprises a positive offset; and 

if said inter-field motion vector prediction comprises a bottom field prediction into a top field (240), said first 
offset comprises a negative offset. 



ss 



4. The method of claim 1 , wherein said first and second offsets have the same magnitude and opposite polarities. 

5. The method of claim 1 , further comprising the step of: 

truncating (255), to a predetermined level of accu racy, said scaled vertical and horizontal displacement parameters. 



7 



L'-R/tf095jto0? [te,//S \PATENT FJLESMxs Systems (1459 JGUWIXS . PaiertsViXS010ClRefere i ices i EP0009b5GC/ cpc] Page B of 12 



EP 0 955 607 A2 

6. The method of claim 1 , wherein said scaled pixel block reference information is produced according to the steps of: 

performing (310) a discrete cosine transform (DCT) operation on an unsealed reference pixel block to produce 
a corresponding DCT coefficient block; 
* truncation (315) a portion of said DCT coefficient block to produce a reduced DCT coefficient block; and 

performing (320) an inverse DCT operation on said reduced DCT coefficient block to produce said scaled pixel 
block. 

7. The method of claim 1 , wherein said scaled pixel block reference information is produced according to the steps of: 

10 

low pass filtering (333) an unsealed reference pixel block to produce a reduced frequency relerence pixel 
block; and 

decimating (335) said reduced frequency reference pixel block to produce said scaled pixel block, 
w 8. I n a video decoder, apparatus comprising: 

a pixel processor (1 20), for receiving decoded reference pixel blocks and producing therefrom scaled reference 
pixel blocks according to a scaling factor; and 

a motion vector processor ( 1 30), for receiving motion vector information associated with a predicted pixel block 
20 and producing therefrom a scaled motion vector according to said scaling factor; wherein 

said motion vector processor, in the case of a said pixel blocks being encoded using a field prediction encoding 
mode including an inter-field motion vector prediction, scaling said motion vector information associated with said 
predicted pixel block by imparting a first offset to a vertical displacement parameter of said motion vector, scaling 
25 said vertical displacement parameter and a horizontal displacement parameter of said motion vector according to 

said scaling factor, and imparting, to said scaled offset vertical displacement parameter, a second offset. 

9. The apparatus of claim 8, wherein: 

said motion vector processor, in the case of a said pixel blocks not being encoded using a field prediction encoding 
a? mode including an inter-field motion vector prediction, scaling said motion vector information associated with said 

predicted pixel block by scaling said vertical displacement parameter of said motion vector and said horizontal 
displacement parameter of said motion vector according to said scaling factor. 

10. The apparatus of claim 8, wherein: 

35 

if said inter-field motion vector prediction comprises a top field prediction into a bottom field, said first offset 
comprises a positive offset; and 

if said inter-field motion vector prediction comprises a bottom field prediction into a top field, said first offset 
comprises a negative offset. 

40 

11. The apparatus of claim 8, wherein: 

said first and second offsets have the same magnitude and opposite polarities. 



8 



EP00095S507 lfileJ/S_ 'PATENT FILESWixs Systems (1459 JGUMXS Pltent$WS010C'JR^eienoes»£PCWW5560 1 cpcj g o! 1 2 



EP 0955 607 A2 



IN 



r 



m 



112 



MV 



r 



.113 



INPUT 


S1 
— ► 


VLD 


S2 


IQ 


S3 


IDCT 




BUFFER 











DATA 



MV PROCESSOR 



MV SCALING 
ROUTINE 2QQ 



^7 



MV 



MOTION 
COMPENSATOR 



30 



S/ 




OUT 



117 



ANCHOR 
FRAME 
MEMORY 



PIXEL PROCESSOR 



PIXEL SCALING 
ROUTINE ^22 



S5' 



FIG. 1 



9 



BPOQQXmO ' (Tie '/S LATENT F LESMxs Systems (145? JGL^VIXS PatentsWiX $01 OCiReferences ; €P: 009^560 V cpc] r-gge iu of li 



EP0 955 607 A2 



205 




200 



235 



MWr = (MW+1)(SCALE FACTOR) - 1 
MVHr = (MVH)(SCALE FACTOR) 



245 



MWr = (MVV-1)(SCALE FACTOR) + 1 
MVHr = MVH * SCALE FACTOR 



MWr = (MW)(SCALE FACTOR) 
MVHr = (MVH)(SCALE FACTOR) 



/- 25 0 ____ _/l 255 
J TRUNCATE Mw7' 



TRUNCATE MVHr' 
L . j 



FIG. 2 



10 



h?0009i(5to; [file MS .PATENT FiLESW xs Systems (1459 JGL)\VIXS PatenissyiXSpiOOReferencesVE.POOOOSSeO"/ cpc] 

EP 0 955 607 A2 



Page n of 12 



305 



PERFORM OCT 
ON PIXEL BLOCK 



310 



I 



TRUNCATE HIGH ORDER 
DCT COEFFICIENTS 
PER SCALE FACTOR 



315 



PERFORM IDCTON 
REMAINING DCT 
COEFFICIENTS 




320 



EXIT 

FIG. 3A 



325 



aooA. 




330 



LOW PASS FILTER 


333 






DECIMATE PIXEL 

BLOCK PER 
SCALE FACTOR 


300B 


1 


L_x-~ 340 



EXIT 

FIG. 3B 



11 



EPC009b5607pe//S.\PATENT FILESVV xs Systems (1459 JGL;WIXS Pa:en{s\ViXS010C\References\EPCO09 -.SbO' cpq 



Page 12 oi 12 



EP 0 955 607 A2 



XX X X X X X X 
x\x X X X X X 
X x\c X X X X X 
x x xSx X X X X 

xxxirxxxx 

XXXXXXXX 

xxxxxxxx 

XXXXXXXX 



FIG. 4A 



X X XIX X X X X 

y y yiy y y y y 

xxxxxxxx 

yyyyyyyy 

xxxxxxxx 

yyyyyyyy 

xxxxxxxx 

yyyyyyyy 

FIG. 5A 




FIG. 4B 

x IX x x 

y y y y 

X X X X 

y y y y 

FIG. 5B 



12 



